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dencies  but  differs  from  the  standard  notion  for  equality-generating  dependencies. 
It  is  argued  that  neither  approach  is  the  correct  one,  but  rather  that  they 
correspond  to  different  policies  on  constraint  enforcement,  and  each  one  is 
appropriate  in  different  circumstances.  Consistency  and  completeness  of  a  state  arc 
characterized  in  terms  of  the  tableau  associated  with  the  state  and  in  terms  of  logi¬ 
cal  properties  of  a  set  of  first-order  sentences  associated  with  the  state.  A  close  rela¬ 
tion  between  the  problems  of  testing  for  consistency  and  completeness  and  of  test¬ 
ing  implication  of  dependencies  is  established,  leading  to  lower  and  upper  bounds 
for  the  complexity  of  consistency  and  completeness,  'flic  possibility  of  formalizing 
dependency  satisfaction  without  using  a  universal  relation  scheme  is  examined. 
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1.  Introduction 

The  starting  point  of  dependency  theory  is  the  notion  that  certain  database  states  are  legal,  or 
correct,  and  others  are  not.  The  class  of  formal  statements  called  dependencies  is  studied  as  a 
language  for  the  specification  of  the  allowable  states.  It  is  therefore  of  great  importance  to  have  a 
precise  notion  of  when  a  database  state  satisfies  a  set  of  dependencies. 

When  the  database  state  consists  of  a  single  relation,  dependency  theory  docs  provide  an  ade¬ 
quate  notion  of  satisfaction.  Dependencies  are  restated  as  first  order  sentences  on  a  language  with  a 
single  predicate  letter,  and  a  relation  satisfies  a  set  of  dependencies  if  it  provides  a  model  for  the 
associated  set  of  sentences.  The  generalization  of  this  notion  to  multi-relation  databases  has  until 
recently  followed  one  of  two  paths.  In  the  first  approach,  a  dependency  is  said  to  hold  in  a  particu¬ 
lar  relation,  so  inter-relation  dependencies  arc  eliminated  by  definition.  The  second  approach  con¬ 
veniently  assumes  that  the  database  consists  of  the  set  of  projections  of  some  universal  relation; 
questions  of  satisfaction  arc  dealt  with  by  examining  this  universal  relation. 

More  recent  work  [H,  V]  has  pointed  out  the  limitations  of  these  approaches  and  proposed  a 

more  appealing  one.  A  database  state  is  said  to  satisfy  a  set  of  functional  dependencies  if  there 

exists  a  universal  relation  /  such  that  /  satisfies  the  dependencies  and  is  a  containing  instance,  that 

is,  the  projections  of  /  on  the  given  relation  schemes  contain  each  of  the  given  relations.  Any  such 

^  Research  supported  by  a  Wcizmann  Post-Doctoral  I'cllowship  and  AI'OSR  j EwHHHasw**  A  Stress:  IBM 
Research  Laboratory,  5000  Cottle  Rd.,  San  Jose  CA  95 193. 
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/  is  called  a  weak  instance  for  die  database  state;  this  notion  is  often  called  weak  satisfaction. 
Detailed  justification  and  discussion  of  this  approach  can  be  found  in  [G,H,V].  it  is  worth  noting 
that,  when  only  functional  dependencies  arc  present,  a  single-relation  database  is  weakly  satisfying 
precisely  when  it  is  satisfying  in  the  standard  sense. 

Given  the  useful  properties  of  weak  satisfaction  [G,M,S,Y],  it  is  tempting  to  generalize  it 
directly  to  other  kinds  of  dependencies  by  saying  that  a  database  state  satisfies  a  set  of  (functional, 
multivalued,  join,  template,  etc.)  dependencies  if  there  exists  a  containing  instance  for  it  that 
satisfies  the  dependencies.  In  this  case  we  shall  say  that  the  state  is  consistent  with  the  dependen¬ 
cies. 

Our  first  objection  to  this  proposal  is  that  weak  satisfaction  now  becomes  different  from  stan¬ 
dard  satisfaction  for  single-relation  databases.  For  example,  if  all  the  dependencies  arc  total  tuple- 
generating  dependencies  (such  as  multivalued  and  join  dependencies),  then  any  database  state 
satisfies  any  set  of  dependencies.  In  particular,  every  single  relation  can  be  made  into  a  weak 
instance  for  itself  by  adding  a  finite  number  of  tuples  to  it  [MMS.BV1]. 

A  second,  related,  objection  is  that  this  proposal  does  not  seem  to  capture  the  intuitive  seman¬ 
tics  of  tuple-generating  dependencies  in  multi-relation  databases  either.  Consider  for  example  the 
following  database. 

Example  1: 


and  dependencies  {SH-*R ,  RH-*C,  C~++S\RH}. 

The  multivalued  dependency  C  -*-*S  |  RH  is  intended  to  express  the  fact  that  a  student  is 
associated  with  every  ( r,h )  pair  such  that  some  course  that  the  student  takes  meets  at  room  r  at 
time  h.  This  constraint  is  intuitively  violated  in  the  example,  since  Jack  is  not  associated  with  room 
B215  on  Wednesdays  at  10  although  he  is  taking  CS378. 

An  alternative  approach  to  defining  satisfaction  is  through  the  notion  of  complete  states,  intro¬ 
duced  for  a  different  puipose  in  (M].  We  say  that  a  state  is  complete  if  it  contains  each  tuple  that 
appears  in  the  projections  of  every  weak  instance  for  the  state.  The  state  in  Example  1  is  not  com¬ 
plete,  because  every  weak  instance  for  it  contains  the  sub-tuple  <Jaek,B7\\WW>,  which  does  not 
appear  in  R  y  When  only  tuple-generating  dependencies  arc  given,  completeness  coincides  with  the 
standard  notion  of  satisfaction  on  single-relation  databases. 

An  objection  to  the  completeness  criterion  is  that  it  seems  unnatural  for  equality-generating 
dependencies  such  as  functional  dependencies.  Consider  the  following  example 

Example  2: 


Student 

Course 

Jack 

CS378 
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Room 
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CS378 

B2I5 
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Student 

Room  Hour 

John 

B320  F12 

with  die  only  dependency  C-*RH. 

This  is  not  a  complete  state,  since  the  sub-tuple  <Jack,B 215.il/ 10>  will  be  forced  by  C-+RH 
to  appear  in  eveiy  weak  instance,  but  it  does  not  appear  in  R3.  However,  it  is  hard  to  argue  that 
this  state  violates  the  C  -*RH  dependency,  which  simply  requires  that  each  course  be  associated 
with  a  unique  room  and  time. 

In  sum,  we  have  described  two  notions  of  dependency  satisfaction,  consistency  and  complete¬ 
ness.  Consistency  is  die  natural  generalization  of  weak  satisfaction  and  seems  appropriate  when  only 
equality-generating  dependencies  are  given,  but  disagrees  with  the  standard  notion  in  the  presence 
of  tuple-generating  dependencies.  Completeness  is  based  on  the  intuitive  semantics  of  tuple¬ 
generating  dependencies  but  appears  unnatural  for  equality-generating  dependencies.  It  is  our 
thesis  that  neither  approach  is  the  correct  one,  but  rather  that  they  correspond  to  different  policies 
on  constraint  enforcement,  and  each  one  is  appropriate  in  different  circumstances. 

After  introducing  definitions  and  notation  in  Section  2,  Section  3  presents  the  notions  of  con¬ 
sistency  and  completeness.  We  show  how  to  construct  for  a  database  state  p  two  sets  of  first  order 
sentences,  Cp  and  Kp,  such  that  p  is  consistent  with  the  given  dependencies  if  and  only  if  Cp  is 
finitely  satisfiable  and  p  is  complete  with  respect  to  the  given  dependencies  if  and  only  if  Kp  is 
finitely  satisfiable.  In  Section  4,  we  characterize  consistency  and  completeness  in  terms  of  the  chase 
of  the  associated  tableaux  [MJ.  When  all  dependencies  arc  total,  our  results  provide  a  decision  pro¬ 
cedure  for  testing  consistency  and  completeness  of  a  state.  However,  we  show  that  testing  whether  a 
state  is  inconsistent  with  a  typed  equality-generating  dependency  and  testing  whether  a  state  is 
incomplete  with  respect  to  a  join  dependency  are  both  NP-complcte  problems.  Furthermore,  the 
general  problems  of  testing  consistency  and  completeness  under  full  dependencies  arc  shown  to  be 
complete  in  exponential  time.  In  Section  5,  we  study  the  decision  problem  for  consistency  and 
completeness  when  embedded  dependencies  arc  present  By  relating  consistency  and  completeness 
to  the  well-studied  question  of  dependency  implication,  we  show  that  both  arc  undccidablc  in  the 
general  case.  Finally,  in  Section  6  we  examine  the  construction  of  sets  of  sentences  similar  to  Cp 
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and  Kp,  but  without  using  a  predicate  that  stands  for  the  universal  relation  scheme.  We  show  that 
this  can  be  done  when  the  database  scheme  is  weakly  cover-embedding,  a  necessary  condition  for 
independence  of  the  database  scheme  [GYj. 

Our  results  deal  with  untyped  relations  and  dependencies,  that  is,  a  value  may  appear  in 
different  columns  of  a  relation.  However,  all  of  the  results,  except  for  Theorems  8,  9  and  15,  can 
be  specialized  to  the  typed  case. 

1  Definitions  and  Notation 

2.1.  Relations,  database  states  and  tableaux 

We  fix  a  finite  set  of  attributes  called  the  universe,  U  =  {A^ . A„}.  Each  attribute  A,  has 

an  associated  infinite  set  called  its  domain  and  denoted  dom(/f,).  Since  we  deal  with  untyped  data¬ 
bases,  we  shall  let  all  the  domains  be  the  same,  say  the  integers.  A  relation  scheme  R  is  a  subset  of 

U.  A  database  scheme  R  =  {/?i . Rk]  is  a  collection  of  relation  schemes  such  that  the  union 

of  the  Rj’ s  is  U.  A  tuple  defined  on  relation  scheme  R  is  a  function  that  maps  each  attribute  in  R 
to  a  value.  The  value  can  be  either  an  integer  or  a  variable  taken  from  an  infinite  set  of  uninter¬ 
preted  symbols.  A  tuple  can  be  visualized  as  a  row  of  table  where  the  columns  are  labeled  by  the 
attributes.  A  tableau  on  R  is  a  finite  set  of  tuples  defined  on  R.  If  /  is  a  tuple  on  R  and  A'  is  a 
subset  of  R ,  /[Y]  denotes  the  restriction  of  t  to  X.  If  t(A)  is  an  integer  for  every  A  in  X,  we  say 
that  /  is  total  on  X.  A  relation  on  R  is  a  tableau  on  R  such  that  every  tuple  is  total  on  R .  A  rela¬ 
tion  on  U  is  called  a  universal  relation. 

For  r  a  tableau  on  R  and  X  a  subset  of  R  ,the  projection  of  r  on  X  is 
irx(r)  =  { /[Y]  |  t€r  and  t  is  total  on  X  }. 

Note  that  our  definition  of  projection  corresponds  to  what  is  sometimes  called  “total  projection”  in 
die  literature,  so  the  projection  of  any  tableau  on  any  set  of  attributes  is  always  a  relation.  When 
R  =  {  Ri,  we  write  wR(r)  for  <irRl(r), ....  rr*t(r)>. 

A  state  of  a  database  scheme  R  is  a  function  p  that  maps  every  relation  scheme  R  in  R  to  a 
relation  on  R .  Wc  write 


P  =  <r\,  ■  •  •  ,r*>  =  <p(R !>,  •  ■  •  ,p(Rk)> 


We  associate  with  each  state  p  of  R  a  tableau  Tp  defined  on  the  universe.  Tp  contains 
exactly  one  tuple  for  each  tuple  in  each  relation  of  p.  The  tuple  r  corresponding  to  tuple  l  in 
p(R)  is  constructed  by  letting  r[A]=/[A]  and  filling  the  rest  of  r  with  distinct  variables  that  appear 
nowhere  else  in  Tp. 

Example  3:  Let  R ={AB,BCD,AD},  and  p  is  the  state  below. 


p(AB) 

A  B 

1  2 
1  3 


p(BCD) 

BCD 

2  5  8 

4  6  7 


p(AD) 
A  D 
l  9 


Then  Tp  is: 


A 

B 

C 

D 

l 

2 

h 

h 

/ 

3 

bj 

b4 

bs 

2 

5 

8 

b6 

4 

6 

7 

1 

bi 

bi 

9 

A  valuation  v  for  a  tableau  R  is  a  mapping  from  the  symbols  in  the  tableau  into  variables 
and  constants  such  that  v(c)=c  for  every  constant  c  that  appears  in  T. 

11  Dependencies 

Following  {BV1J,  we  use  tableaux  to  represent  implicational  dependencies.  A  template  depen¬ 
dency  (td)  is  a  pair  d  =  <T,w>,  where  T  is  a  tr  ’eau  contain  j  no  constants  and  w  is  a  tuple  con* 
raining  no  constants.  We  say  d  is  fitll  or  total  it  {appears  in  T  for  every  attribute  A.  Other¬ 
wise,  d  is  said  to  be  embedded  or  partial.  A  relation  /  satisfies  d  if  for  every  valuation  v  such  that 
v(r)£/,  there  exists  an  extension  v'  of  v  to  all  the  symbols  of  w  such  that  v'(w)Cl.  Informally,  a 


template  dependency  says  that  if  certain  tuples  appear  in  I,  then  some  other  tuple  must  also 
appear.  Template  dependencies  arc  a  special  casc-of  the  tuple- generating  dependencies  ( tgd's ),  where 
a  set  of  tuples  is  allowed  instead  of  the  single  tuple  w.  For  total  dependencies,  one  can  assume 
without  loss  of  generality  that  a  single  topic  appears  on  the  right  hand  side  [BV1];  thus  total  tem¬ 
plate  dependencies  are  no  less  general  than  total  tuple-generating  dependencies.  Join  dependencies 
[ABU,Ri]  arc  a  specia  case  of  total  id’s.  An  equality-generating  dependency  (egd)  is  a  pair 
d  =  where  7  is  a  tableau  containing  no  constants,  and  a\,ai  are  variables  that  appear 

in  7  for  some  A.  A  tableau  S  satisfies  an  egd  d  =  <7,(ai,a2)>  if  for  every  valuation  v  such  that 
v(T)CS,  v(a\)  =  v(a2).  Functional  dependencies  are  a  special  case  of  egd’s. 

Egd’s  also  act  like  tgd’s,  since  by  generating  new  equalities  they  generate  new  tuples.  This 
action  can  be  simulated  by  total  td’s.  Bccri  and  Vardi  [BV1,  BV2J  show  how  to  construct,  given  a 
set  D  of  dependencies,  a  set  D  of  tgd's  that  has  the  following  properties: 

(1)  D  is  obtained  from  D  by  replacing  each  egd  by  some  td’s. 

(2)  D|=D. 

(3)  Let  d  be  a  tgd.  If  D  fcd  then  D  )=d. 

We  call  D  the  egd- free  version  of  D . 

3.  Consistency  and  Completeness 

In  this  section  we  define  two  properties  of  a  database  state  with  respect  to  a  set  of  dependen¬ 
cies,  consistency  and  completeness,  which  we  consider  to  be  two  different  aspects  of  dependency 
satisfaction.  We  characterize  these  properties  in  terms  of  the  satisfiability  of  two  first-order  theories 
associated  with  the  state. 

Let  us  fix  a  database  scheme  R={/?i . Rn}.  Let  WEAK(D,p)  be  the  set  of  all  weak 

instances  for  a  database  state  p  under  a  set  of  dependencies  D.  That  is,  weak (l),p)  is  the  set  of  all 
universal  relations  satisfying  D  such  that  their  projections  contain  each  relation  in  p.  Say  a  state  p 
is  consistent  with  set  of  dependencies  D  if  weak (D,p)*0.  The  completion  of  a  state  p,  p+,  is 

defined  by 


P+  =  H  (irR(/)}, 

/CWi’AKtO.p) 


where  the  intersection  is  taken  relation-wise.  Note  that  pCp*  for  any  p.  Say  a  state  p  is  complete 
with  respect  to  set  of  dependencies  D  if  p  equals  its  completion,  that  is,  p  =  p+. 


Intuitively,  a  state  p  is  consistent  if  there  is  some  way  of  adding  tuples  to  relations  of  p  that 
will  transform  it  into  the  set  of  projections  of  some  satisfying  universal  instance.  If  p  is  consistent, 
then  there  arc  many  different  sets  of  tuples  which  can  be  added  to  it  to  demonstrate  its  consistency. 
However,  there  arc  certain  tuples  which  will  have  to  appear  in  every  such  extension  of  p.  If  all 
these  necessary  tuples  are  already  in  p  to  begin  with,  then  we  say  p  is  complete.  Note  that  the 
definition  of  completeness  is  based  on  die  egd-free  version  of  D,  D.  This  is  done  to  allow  con¬ 
sistency  and  completeness  to  be  independent  notions.  While  weak (Z),p)  could  be  empty  (for 
inconsistent  p),  WEAK (D,p)  is  never  empty.  We  will  show  later  that,  for  consistent  states,  it  does 
not  matter  whether  D  or  D  is  used. 

Several  workers  [GM,Ni]  have  advocated  the  use  of  first  order  logic  to  express  dependencies 
and  other  constraints.  As  we  explained  in  the  introduction,  their  approach  docs  not  easily  generalize 
to  dependencies  in  multi-relation  databases.  Given  a  dependency  statement  such  as  X~*Y  in  a 
database  scheme  where  X  and  Y  may  not  appear  together  in  one  relation  scheme,  or  may  appear 
in  more  than  one,  it  is  not  clear  how  satisfaction  of  the  dependency  can  be  formalized  as  satisfac¬ 
tion  of  some  first  order  sentence. 

The  notions  of  satisfaction  proposed  above  do  provide  a  means  of  using  first  order  logic  to 
formalize  dependencies.  However,  a  rather  drastic  shift  in  point  of  view  is  required.  It  is  no  longer 
possible  to  write  down  a  sentence  for  each  dependency  and  ask  whether  the  database  provides  a 
model  for  each  of  these  sentences.  Consider  for  example  the  notion  of  consistency  as  satisfaction, 
and  let  d\  =  A-*C,  d2  =  B~*C ,  with  the  database  scheme  {AB,BC}.  Let  p(AB)  =  <00,01>  and 
p(BC)  =  <01,12>.  It  is  easy  to  see  that  p  is  consistent  with  d\  and  with  d2,  but  it  is  not  consistent 
with  {di,d2}. 

Our  approach  is  to  construct  two  sets  of  sentences,  Cp  and  Kp,  for  each  state  p.  We  will  show 
that  p  is  consistent  exactly  when  Cp  is  finitely  satisfiablc  and  that  p  is  complete  exactly  when  Kp  is 
finitely  satisfiable.  Thus  we  reduce  both  notions  of  satisfaction  to  the  standard  logical  notion  of 
finite  satisfiability.  Note  that  in  this  approach  consistency  and  completeness  of  a  state  are  not  first 
order  notions;  they  are  statements  about  first  order  theories  rather  than  statements  in  these  theories. 

Before  constructing  the  sets  of  sentences  Cp  and  Kp,  we  fix  a  linear  ordering  on  the  elements 

of  the  universe  U.  We  now  write  U  as  the  sequence  <A\ . A„>.  Each  relation  scheme  R  in  R 

may  be  written  as  the  sequence  <AI{ . Aim>,  where  ij<ik  for  j<k. 

Cp  and  Kp  each  contain  two  uibscts  of  sentences,  the  database  scheme  axioms  and  the  state 
axioms.  The  scheme  axioms  depend  only  on  the  database  scheme  R  and  tire  set  of  dependencies  D ; 
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the  state  axioms  depend  on  the  state  p. 

The  scheme  axioms  of  both  Cp  and  Kp  include  the  containing  instance  axioms.  For  each 
relation  scheme  R,  there  is  one  containing  instance  axiom  that  says  that  every  tuple  in  the  R- 
relation  of  the  state  must  be  the  projection  on  R  of  some  tuple  of  the  universal  relation.  In  other 
words,  the  containing  instance  axioms  assert  the  existence  of  a  containing  instance  for  the  state. 
Formally,  for  each  relation  scheme  R  =  <Ait . Aim>  in  R,  there  is  a  sentence  of  the  form 

V  a 3  y(R(a\ . a„)-t’U(y0,ai,yl,a2,...,am,ym)) 

where  a  is  the  sequence  a^a^  . . . .  am  and  y  is  the  sequence  <y<j, . . .  ,ym>,  no  oy  appears  in  y  and 
no  symbol  appears  more  than  once  in  y.  The  sequence  yj  is  of  length  i/+i-(//  +  l),  where  i'o=0 
and  im+i=n  +  l. 

The  scheme  axioms  of  Cp  also  include  the  dependency  axioms,  which  are  just  the  dependen¬ 
cies  in  D  encoded  as  implicational  sentences  as  described  by  Fagin  [F].  Kp  also  contains  depen¬ 
dency  axioms,  but  in  this  case  we  use  the  egd-free  version  of  D,  D,  rather  than  D  itself. 

The  state  axioms  of  both  Cp  and  Kp  contain  the  state  p  encoded  as  a  set  of  quantifier-free 
sentences.  For  each  tuple  . . .  ,am>  in  p(R),  we  include  the  sentence  /J(alt . . .  ,am ),  where 
the  at' s  are  constants. 

Finally,  the  state  axioms  of  Cp  include  the  distinctness  axioms,  and  those  of  Kp  include  the 
completeness  axioms.  Hie  distinctness  axioms  arc  the  set  of  inequalities  c*d,  where  c  and  d  are 

distinct  constants  appearing  in  p.  The  completeness  axioms  contain,  for  each  tuple  <a\ . am> 

such  that  each  at  appears  in  p  but  the  tuple  itself  docs  not  rppear  in  p(R ),  the  sentence 

Yy(-i{/(yo,alt - am,y„)) 

where  y  is  constructed  as  in  the  containing  instance  axioms.  Intuitively,  the  completeness  axioms 
say  that  only  tuples  appearing  in  p(R )  can  be  in  the  projection  of  the  universal  relation  on  R. 

Example  4:  We  construct  Cp  and  Kp  for  the  state  shown  in  Example  1.  U  is  the  sequence 
<S,C,R,H>;  R  contains  the  schemes  Ri  =  <S,C>,  Ri  =  <C,R,H>,  and  R)  =  <S,R,H>.  The 
dependencies  are  the  functional  dependencies  SH-*R,  RH-*C ,  and  the  multivalued  dependency 
C-+S\RH. 
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•  Containing  Instance  Axioms: 


Y  s,c  9  r,A  U(s,c,r,h )) 

Y  c,r,h  9  s(R^c,r,h}~*U(s,c,r.h )) 

Y  s,r,h  9  c  (Ri(s,r,h)-*U(s,c,r,h)) 


•  Dependencies: 


(Y  J|CiC2/rirjr2)(f/(il,Ci.r1,/i1)Ai/(j1,c2,r2,/ii)-»  r1  =  r2) 

(Y  SiS2CiC2hiri)(U(si,cx,ri,h\)KU(s2,C2'rbhi)~*  ^1=^2) 

(Y  V2Cir1r2A1A2Xl/(si,Ci,ri,/ii)Al/(s2,c1,r2,/i2)-»  l/(j2,Ci,rifAj)) 


■  Egd-free  Dependency  Axioms: 


(Y  S1S2C iC2cjA iA2rir2X  1/ (si.c l.ri.A^i/ ( Ji,c2,r2,A  i)a£/(s2,c  j,rj,A2) 

t/(s2,Cj,r2,A2)) 

(Y  iiS2sjCic2A1A2rlr2)(t/(s1,ci,ri,Ai)Af/(s2,c2,ri,Ai)Ai7(sj,c1,r2,A2) 

“*■  UishCiJbhd) 

(Y  SiJ2Cirir2A1A2Xi/(si,ci,/-1,Ai)Aj/(s2,ci,r2.A2)-*  i/(s2,ci,ri,Ax)) 


•  State  axioms: 


*i(A«r*,CS378) 


J?2(C5  378,5215,  M 10) 
MCS378.JB21W10) 


/?  j(/ac  A  ,/?215,A/ 10) 


'■  ;•  .■  v.* ji’ vv  .-"v  *.* 


TO? 


•  Distinctness  axioms: 


5215*5213 
A/ 10*  #10 
Jack*CSm 
Jack*  BUS 
etc. 

•  Completeness  axioms: 

For  Ri’. 

Y  r,h  ->U(CSm,CSm,r,h) 

V  r,h  -*U(B2\S,CSm,r,h) 

etc. 


For  R2 : 

V  J  -i{/(j,CS378,5213,W10) 
Ys-i(/(s,CS  378,6215, 1F10) 
etc. 


For  5j*. 


Y  c~>U(Jack, c, 6  213,^10) 


etc. 

Cp  consists  of  the  containing  instance  axioms,  the  dependency  axioms,  the  state  axioms  and 
die  distinctness  axioms.  Kp  consists  of  the  containing  instance  axioms,  the  cgd-frcc  dependency 
axioms,  the  state  axioms  and  the  completeness  axioms. 

Before  proceeding  to  our  results,  we  introduce  some  basic  definitions  and  notation  of  model 
theory.  A  structure  for  a  language  /,  consists  of  a  domain  of  elements  and  an  interpretation  of  each 
predicate  and  constant  of  /..  A  structure  with  a  finite  domain  is  finite,  if  M  is  a  structure  with 
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domain  A  and  P  is  a  k-ary  predicate  symbol,  then  M(/*) C  A*  is  the  interpretation  of  P  in  M, 
and  if  c  is  a  constant,  M(c)€  A  is  the  interpretation  of  c  in  M. 

A  structure  M  for  a  language  L  is  a  model  of  a  set  of  sentences  X  if  for  every  o  €  £,  <r  is 
true  in  M,  written  A/  {=  o  (we  assume  the  reader  to  be  familiar  with  the  notion  of  truth  in  a 
model).  A  set  of  sentences  is  finitely  satisfiable  if  it  has  a  finite  model. 

The  next  two  theorems  establish  that  finite  satisfiability  of  Cp  and  Kp  arc  identical  respec¬ 
tively  to  consistency  and  completeness  of  p  with  respect  to  D. 

Theorem  1:  Cp  is  finitely  satisfiable  if  and  only  if  p  is  consistent  with  D. 

Proof:  If  p  is  consistent,  it  is  clear  that  for  every  /  €  wlak( Z),p)  the  structure  M  with 
M (R )  =  p(R )  and  M(U)  =  /  is  a  finite  model  of  Cp.  For  the  converse,  let  M  be  a  finite  model  of 
Cp.  For  each  pair  of  constants  c,  d  in  the  language  of  Cp,  wc  have  M  |=  c*d,  so  M(c)*M(d). 
Thus  wc  can  assume  without  loss  of  generality  that  constants  are  interpreted  as  themselves,  i.e. 
M(c)  =  c.  It  is  easy  to  verify  that  M (U)  is  a  containing  instance  for  p  that  satisfies  D.  That  is, 
M((/)€weak(0,p),  so  p  is  consistent  with  D.  □ 

Theorem  2:  Kp  is  finitely  satisfiable  if  and  only  if  p  is  complete  with  respect  to  D. 

Proof:  Suppose  Kp  is  satisfiable  with  a  finite  model  M.  Wc  claim  that  Kp  has  a  finite  model 
M'  where  no  two  constants  have  the  same  interpretation.  To  construct  M’,  wc  replace  each  ele¬ 
ment  of  the  domain  of  M  with  many  distinct  copies  of  that  clement.  For  example,  if 
<a,6,c>€  M(R),  then  M'(/?)  would  contain  tuples  <ai,bi,ci>,  <ai,bi,C2>,  <oi,b2,cj>,  etc.  If  two 
constants  arc  interpreted  as  the  same  clement  in  M,  they  would  be  interpreted  as  two  distinct  copies 
of  that  element  in  M\  Since  Kp  docs  not  have  equality  in  it,  M'  is  still  a  model  of  Kp.  Thus, 
without  loss  of  gcncraity,  wc  can  assume  that  in  M'  constants  are  interpreted  as  themselves,  so 
M'(f/)€WEAK(/),p).  By  the  completeness  axioms,  wR(f)  docs  not  contain  any  tuple  constructed 
from  values  appearing  in  p  but  not  itself  appearing  in  p.  Hence,  p+  cannot  contain  any  such  tuple, 
so  p  is  complete. 

For  the  converse,  suppose  that  p  is  complete.  Consider  the  set  S  of  all  tuples  /  on  some  rela¬ 
tion  scheme  R  such  that  l  is  constructed  from  values  appearing  in  p  but  t  is  not  in  p.  Since  p  is 
complete,  if  t  is  an  /{-tuple  in  S.  then  there  is  a  universal  relation  /,  €  wi  AK(O.p)  such  that 
/€**(/,).  Wc  use  now  the  direct  product  construction  [(•]  to  produce  a  universal  relation 
/  €  WKAK(/2,p)  such  that  if  I  is  an  /{-tuple  from  .V,  then  t  £  tt«(/).  I  is  the  direct  product  X /,. 

constructed  as  follows.  Let  S  =  {t\ . The  values  in  I  arc  »» -sequences  c  -  <cj . cm> 


» 


•  •» 


of  constants  from  p,  where  wc  identify  the  m -sequence  <c,c.  •  •  •  ,c>  with  the  constant  c.  Now  a 
tuple  s  is  in  /  if  and  only  if  the  tuple  s„  which  is  obtained  from  s  by  projecting  each  w -sequence 
on  its  /'-th  component,  is  in  7(j.  It  is  straightforward  to  verify  that  /  is  a  containing  instance  for  p. 
Furtltermore,  since  dependencies  are  preserved  under  direct  product  [FJ,  I  must  satisfy  D.  The 
finite  structure  M  with  M(R)  =  p(R)  for  each  R  and  M(t/)  =  /  is  a  model  of  Kp.  □ 

4.  Testing  Satisfaction  under  Full  Dependencies 

In  this  section  we  show  that  both  consistency  and  completeness  of  a  state  can  be  tested  by 
chasing  the  associated  tableau,  when  embedded  dependencies  are  not  present.  Thus  the  upper 
bounds  on  complexity  known  for  chasing  tableaux  under  various  special  kinds  of  full  dependencies 
apply  also  to  testing  satisfaction.  Throughout  this  section,  D  is  a  set  of  full  dependencies.  We  start 
by  defining  the  chase  of  a  tableau  under  full  dependencies  and  establishing  preliminary  results 
about  chasing  with  the  egd-free  version  of  D. 

The  chase  of  a  tableau  T  with  respect  to  a  set  of  dependencies  D,  denoted  by  CHASEo  (7*),  is 
the  result  of  applying  the  following  two  transformation  rules  exhaustively  to  T. 

Td-rule: 

If  <S,w>  is  a  td  in  D,  and  there  exists  a  valuation  v  such  that  v(i')C7\  add  v(w)  to  the  rows 
of  T. 

Egd-rule: 

Suppose  <S,(ai,a2»  is  an  egd  in  D,  and  there  exists  a  valuation  v  such  that  v(S)CT,  and 
vCaO.vfaj)  arc  not  both  constants.  If  only  one  of  v(ai),v(a2)  is  a  constant,  rename  all 
occurrences  of  the  other  one  in  T  to  that  constant  If  both  are  variables,  rename  all 
occurrences  of  the  higher  numbered  variable  to  the  lower  numbered  one. 


Given  a  tableau  Tp  and  set  of  dependencies  D,  wc  will  write 

Tp  =  cilASi-oITp) 


rp+  -  CHASI ijjiTp) 


where  f)  is  the  cgd-frcc  version  of  D . 


Wc  need  some  preliminary  results  before  characterizing  consistent  suites. 


vw>,>w 


Leman  1:  For  each  /€weak(Dj>),  there  is  a  valuation  v  such  that  v(T‘p)CI. 

Proof:  By  induction  on  the  computation  of  T*f.  □ 

2:  For  “» "**“  **  *=  *0 

Proof:  Let  7€weak(/)^>).  Let  v  be  the  valuation  of  the  previous  lemma,  v(Tp  )QI.  Then 


*a(7,;)g*t(v(r;))c*a(/x 


It  follows  that  w^Tt)  £  ,  n  {**(/)}.  For  the  other  inclusion,  let  f  be  a  tuple  in  the  Rt- 

F  /CWBAB (0*) 

component  of  O  {««(/)}.  Let  v  be  an  injective  valuation  for  T\  that  maps  each  variable 
/CweakU)^)  p 

to  a  constant  not  appearing  in  t.  Since  v(7^)€wEAK(D,p),  r€*Xf(v(Tp».  By  construction  of  v, 
this  implies  that  i  must  come  from  some  ^ -total  tuple  of  T*p,  so  iZw^iTp).  □ 

Consistency  and  completeness  of  a  state  can  be  characterized  in  terms  of  the  associated 
tableau  Tp  as  follows. 

Theorem  3:  The  following  are  equivalent 

(a)  p  is  consistent  with  D. 

(b)  Tp  satisfies/). 


(a)  implies  (b):  I  jet  /  €weak(Z)  ,p),  and  let  v  be  the  valuation  of  Lemma  1.  Suppose  that  Tp  does 
not  satisfy  some  </€/).  d  cannot  be  a  td,  since  in  that  case  a  td-rule  is  applicable  to  Tp.  Thus  d 
must  be  an  egd  <S,{a\,a7)>,  and  there  is  a  valuation  v'  such  that  v\S)CTp  and  v'(ai)*v'(aj)- 
Both  v'(a\)  and  v\aj)  must  be  constants,  otherwise  an  egd-rule  is  applicable  to  Tp.  But  now 
v(v\S))Ql  and  v(v'{ai))*v(v'(fl2))<  so  I  docs  not  satisfy  D  -  a  contradiction. 

(b)  implies  (a):  Let  v  be  an  injective  valuation  for  Tp  that  map  each  variable  to  a  constant  not 
appearing  in  p.  Then  v(Tp)  satisfies  D  and  it  is  a  containing  instance  for  p.  lhus 
WEAK (/>,p)*0.  □ 

We  need  two  more  preliminary  results  before  characterizing  complete  states. 

Lemma  3:  For  each  /€wiiAK(/),p),  there  is  a  valuation  v  such  that  v(Tp  )£/. 

Proof:  By  induction  on  the  computation  of  Tp  .  □ 

The  next  lemma  shows  that  the  completion  of  a  state  can  be  obtained  from  Tp  . 
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La—  4:  For  any  state  p,  p+  =  *a (Tf ). 

Proof:  Let  /€WEAtc(Z),p).  Let  v  be  the  valuation  of  the  previous  lemma,  v(Tp  )C/.  Then 

*n(7,(r)C*a(v(7'p+))C*t(n 

It  follows  that  w«(rp+ )  C  p+.  For  the  other  inclusion,  let  t  be  a  tuple  in  the  ^/-component  of  p+. 
Let  v  be  an  injective  valuation  for  Tp  that  maps  each  variable  to  a  constant  not  appearing  in  t. 
Since  v(Tf  )€weak(D,p),  f€rg/v(  Tf).  By  construction  of  v,  this  implies  that  t  must  come 
from  some  A, -total  tuple  of  T£ ,  so  itugJLTf ).  □ 

Theorem  4:  The  following  are  equivalent 

(a)  p  is  complete  with  respect  to  D. 

(b)  p  is  complete  with  respect  to  D. 

(c) p  =  *n(r;). 

Proof:  The  equivalence  of  (a)  and  (b)  is  immediate  from  the  fact  that  D-D.  The 
equivalence  of  (a)  and  (c)  follows  directly  from  Lemma  4.  □ 

We  have  defined  completeness  and  consistency  to  be  independent  notions.  However,  it  is 
interesting  to  note  that  for  consistent  states,  the  notion  of  completeness  can  be  simplified  as  fol¬ 
lows. 

Theorem  5:  For  state  p  consistent  with  dependencies  D.  the  following  arc  equivalent: 

(a)  p  is  complete  with  respect  to  D. 

(b) p  =  m^rp). 

Proof: 

(b)  equivalent  to  (c):  Follows  from  Lemma  2. 

(a)  equivalent  to  (b):  By  Theorem  2,  p  is  complete  wit  D  iff  if  p  -  wK (Tp ).  Wc  claim  that 
*h(T;)  =  w«(77  )  for  consistent  states.  Since  p  is  consistent,  by  Theorem  1,  Tp  satisfies  /).  By 
property  (2)  of  D,  wc  also  have  that  Tp  satisfies  D.  Hence,  by  1  einma  3.  there  is  a  valuation  v 
such  that  v(Tp)  CTp.  Consequently,  v^(Tp  )C  ).  For  the  other  inclusion,  let  /  be  a 
tuple  in  die  A,- -component  of  wr(7'p).  Let  v  be  a  map  for  Tp  that  sends  distinct  comtanls  to  dis 
tinct  variables.  Let  /'=  t>(7’p),  and  let  s  be  a  tuple  such  that  .v[A,  j-  »•(/)  and  the  test  of  i  consists 
of  distinct  new  variables.  Wc  claim  that  D  </',.*>.  Indeed,  there  is  a  tuple  t\  in  Tp  such  that 
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*iUM=L  Therefore,  there  is  a  tuple  sj  in  C11ASI-d(7)  such  that  By  the  results  in 

[BV1J,  it  follows  that  D  \=<T,s>.  But  then  also  D  |= <T,s> ,  by  property  (3)  of  D.  Thus,  by  the 
results  in  [BV1],  there  is  a  tuple  S2  in  CHASER?)  such  that  S][£j]=s[Ri].  It  follows  that 
),  a 


Corollary  l:  The  following  are  equivalent: 

(a)  p  is  consistent  and  complete  with  respect  to  D. 

(b)  Tp  satisfies  D  and  p  =  wR(7p). 

(c) p=  fl  {»„(/)}.  □ 

/€weak(/)^>) 


The  next  theorem  relates  consistency  and  completeness  to  standard  satisfaction  for  single  rela¬ 
tions. 


Theorem  6:  For  R  =  {U},  p(U)  satisfies  D  if  and  only  if  p  is  consistent  and  complete  with 
respect  to  D. 

Proof:  (Only  if)  Let  J  =  p(U)  6  SatCD).  Since  J  €WEAK(Z),p),  p  is  consistent  with  D. 
We  claim  also  that  p  =  Since  p  =  **(./),  clearly  /€wiQ iD  p)MD}Qp. 

Furthermore,  for  every  /€  WEAK(/),p),  JCI,  that  is,  p  C  »«(/),  so  pC  O  (irR(/)}. 

/€WFAK(0,p) 

Therefore  p  is  complete  wrt  D. 

(10  Let  p  be  consistent  and  complete  with  respect  to  D.  Then  T*p  satisfies  D  by  Theorem  3, 
and  p  =  wr(T’p)  by  Theorem  5.  Since  R  =  {£/}  and  all  dependencies  are  total,  clearly 
vg(Tp)  =  Tp,  hence  p  =  T*p  satisfies  D.  □ 

As  a  consequence  of  Theorems  3  and  4,  the  chase  is  a  decision  procedure  for  consistency  and 
completeness  under  full  dependencies.  In  the  rest  of  this  section  we  shall  give  upper  and  lower 
complexity  bounds  for  these  problems. 

We  first  give  NP-completencss  results  that  follow  from  Theorem  6. 

Theorem  7: 


(1)  For  R=  {(/},  testing  whether  a  state  p  is  inconsistent  with  a  typed  egd  or  whether  it  is  incom¬ 
plete  with  respect  to  a  jd  is  NP-complcte. 

(2)  For  R= {(/},  testing  whether  a  state  p  is  not  complete  with  respect  to  a  set  D  of  full  depen¬ 
dencies  is  NP-compIctc. 


(3)  Testing  whether  a  state  p  is  inconsistent  with  a  set  D  of  cgd’s  is  NP-complctc. 

Proof: 

(1)  In  [MSY]  it  is  shown  that  testing  whether  a  relation  violates  a  jd  is  NP-complete,  and  in 
[BV3]  it  is  shown  that  testing  whether  a  relation  violates  a  typed  egd  is  NP-complete.  The 
claim  now  follows  by  Theorem  6. 

(2)  The  claim  follows  by  the  above  mentioned  NP-complctencss  results  in  [BV3,  MSY]  and 
'  Theorem  6. 

(3)  NP-hardness  follows  from  the  first  claim;  we  have  to  show  that  the  problem  is  in  NP.  To  test 
for  inconsistency,  one  constructs  Tp  and  chases  it  by  D.  If  at  any  stage  the  chase  require 
identifying  two  constants,  then  p  is  inconsistent  with  D.  By  [BV3]  chasing  by  cgd’s  can  be 
done  in  nondeterminisdc  polynomial  time.  □ 

We  now  refer  to  the  general  case  and  prove  lower  and  upper  exponential  time  bounds. 

Theorem  8:  Testing  whether  a  state  p  is  consistent  with  a  set  D  of  full  dependencies  is 
EXPTIME-complete. 

Proof:  As  observed  before,  to  test  for  inconsistency,  one  constructs  Tp  and  chases  it  by  D . 
If  at  any  stage  the  chase  require  identifying  two  constants,  then  p  is  inconsistent  with  D .  Other¬ 
wise,  it  is  consistent  An  analysis  of  the  chase  in  [BV3]  shows  that  it  can  be  done  in  exponential 
time.  It  remains  to  show  that  the  problem  is  EXPTIME-hard.  We  show  it  by  reduction  from  the 
implication  problem  for  full  td’s,  which  was  shown  in  [CLM]  to  be  EXPTIME-complete.  That  is, 
given  a  set  D  of  full  td’s  and  a  full  td  d,  we  construct  in  polynomial  time  a  set  O'  of  full  depen¬ 
dencies  and  a  state  p  such  that  D  \=d  iff  p  is  inconsistent  with  D'. 

Let  U  be  the  relation  scheme  for  the  dependencies  DUd.  Let  d  be  <T,w>,  with 
7’={wi> . . . ,  wm}.  Without  loss  of  generality  assume  that  there  arc  at  least  two  variables  in  T. 
To  test  whether  D  \=d,  we  chase  T  by  D  and  sec  whether  w  is  generated.  The  idea  of  the  reduc¬ 
tion  is  to  have  a  state  p  that  “looks  like”  T  and  a  set  O'  of  dependencies  that  simulate  D  and  in 
addition  force  identification  of  two  constants  if  w  is  generated.  In  order  to  do  that  wc  need  to 
mark  the  tuples  in  the  original  state  and  the  tuples  that  arc  generated  by  the  chase.  1'hc  marking  is 
done  by  equalities  satisfied  by  the  tuples.  For  that  we  add  new  attributes:  the  database  scheme  is 
R = {(/'},  where 


U'=U  U{/M, . . Bm }. 
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Let  a  be  a  one-to-one  valuation  that  maps  the  variables  in  T  to  constants.  p(U')  has  tuples 
u\, . . .  ,um  that  correspond  to  the  tuples  wj. . . . ,  wm  in  T  in  the  following  way: 

(1)  «i[t/}=a(w,), 

(2)  Uj[i4]  and  Uj{y</]  are  the  same  new  constant,  and 

(3)  ui  has  distinct  new  constants  elsewhere. 

Note  that,  since  R  has  a  single  relation  scheme,  Tp  is  just  p(lO. 

Let  now  <S,v>  be  a  full  td  in  D.  We  construct  a  full  td  <S',v'>  on  U'  and  put  it  in  D'.  For 
each  tuple  v,  in  S  we  have  a  tuple  v’t  in  S'  defined  as  follows: 

(1)  v',lt/l=v„ 

(2)  v'i  has  distinct  new  variables  elsewhere, 
v'  is  defined  as  follows: 

(1)  v'[t/]=v,  and 

(2)  vlA.Ah  •  •  •  .Am]=vlB,Bi, . . .  ,Bm]=vx[B,Bh  ....  BmY 

For  example,  if  <S,v>  is: 


F 

G 

H 

v: 

f 

g 

h 

v/; 

f 

g 

hi 

V 

f 

gl 

h 

V 

fl 

g 

g 

Then  <S',v'>  is: 


U 


***** 
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(Dot  represents  variables  with  unique  occurrences). 

In  addition  we  put  in  Z)'  an  egd  <T\(aha2)>,  where  a\  and  a2  are  two  distinct  variables  from 
T.  T‘  has  tuples  w{, ....  wm\w'  that  correspond  to  wj, . . . ,  wm,w. 

w't  is  defined  as  follows: 

(1) 

(2)  w',[A)  and  w't[A,]  arc  the  same  new  variable,  and 

(3)  w't  has  new  distinct  variables  elsewhere. 
w'  is  defined  as  follows: 

(1)  w1l/]=w,  and 

(2)  w'  has  new  distinct  variables  elsewhere. 

For  example,  if  <T,w>  is: 


F 
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w: 
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h 
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fl 
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Then  foe  constructed  egd  is  ,/  1)>,  where  T'  is: 


■M 
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Bj  B2  B3  F  G  H 
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f  . 

f 

a3  .  a3 

• 

•  fl 

g 

h 

i: 

•  •  •  •  •  • 

• 

■  f 

g 

h 

(Dot  represents  variables  with  unique  occurrences). 

To  prove  that  D  \=d  iff  p  is  inconsistent  with  D'  we  show  that  a  chase  of  T  by  D  can  be 
simulated  by  a  chase  of  7^  by  D'  and  vice  versa. 

Consider  first  a  chase  of  T  by  D.  We  claim  that  for  any  tuple  t  generated  by  a  td  <S,v>  in  a 
chase  of  T  by  D,  one  can  generate  a  tuple  t'  by  the  td  <S',v'>  in  a  chase  of  Tpby  D'  such  that: 

(1)  /'[{/!= a(r).  and 

(2)  forlorn. 

We  leave  the  verification  of  this  claim  to  the  reader.  If  D\=d,  then  w  is  generated  by  the  chase  of 
T  by  D.  Therefore,  a  chase  of  Tp  by  D'  generates  a  tuple  u  such  that  «[(/]= o(tv).  Let  us  now 
apply  the  egd  <T',(aha2)>  with  a  valuation  p  that  maps  w't  to  u/  and  w'  to  u.  j 3  agrees  with  a  on 
the  variables  of  T,  so  we  arc  forced  to  identify  a(ai)  and  a(a 2).  That  means  that  p  is  inconsistent 
with  D‘. 

Consider  now  a  chase  of  Tp  by  D'.  We  claim  that  for  any  tuple  /'  generated  by  a  td  <S\v’> 
in  a  chase  of  Tp  by  D',  one  can  generate  a  tuple  /  by  the  td  <S,v>  in  a  chase  of  T  by  D  such 

that: 

(1)  /'[!/]=«(/),  and 

(2)  /'MJ*/U1  forlorn. 

We  leave  the  verification  of  this  claim  to  the  reader.  If  p  is  inconsistent  with  D'  then  the  egd 
<r',(a1,a2)>  must  be  applied  with  some  valuation  p.  But  since  w\[A ]=w'/[/4(],  w)  cannot  be 
mapped  by  P  to  any  other  tuple  but  ut.  Thus  P  agrees  with  a  on  the  variables  of  T.  In  particular, 
/Hw'X £/)=<*(*).  That  is,  w'  must  be  mapped  to  a  tuple  u  generated  by  the  chase  of  Tp  such  that 
«[I/]=a(w).  But  then  w  is  generated  by  a  chase  of  T  by  D,  so  D  ^d. 

To  complete  the  proof  we  note  that  the  reduction  from  D  and  d  to  p  and  D'  can  be  done  in 
polynomial  time.  □ 


Corollary  2:  For  R={1/},  testing  whether  a  state  p  is  consistent  with  a  set  D  of  full  depen¬ 
dencies  is  EXPriME-complete.  □ 

The  corollary  should  be  contrasted  with  clause  (2)  of  Theorem  7.  While  consistency  is 
EXPTIME-complete  even  for  database  schemes  with  a  single  relation  scheme,  for  completeness 
there  is  a  complexity  gap  between  the  case  of  database  schemes  with  a  single  relation  scheme  and 
die  case  of  database  schemes  with  two  relation  schemes. 

Theorem  9:  Testing  whether  a  state  p  is  complete  with  respect  to  a  set  Z)  of  full  td’s  is 
EXPTIME-complete. 

Proof:  To  test  for  incompleteness,  one  constructs  Tp  and  chases  it  by  D.  If  at  any  stage  a 
tuple  i  is  generated  such  that  has  no  variables  and  /[#/]  is  not  in  p(/J,)  for  some  of  die  rela¬ 
tion  schcms  Ri  in  the  database  scheme  R,  then  p  is  incomplete  with  respect  D .  Otherwise,  it  is 
complete.  An  analysis  of  the  chase  in  [BV3]  shows  that  it  can  be  done  in  exponential  time.  It 
remains  to  show  that  the  problem  is  EXPTIME-hard.  We  show  it  by  reduction  from  the  implica¬ 
tion  problem  for  full  td’s  which  was  shown  in  [CLM]  to  be  EXPTIME-complete.  That  is,  given  a 
set  D  of  full  td’s  and  a  full  td  d,  we  construct  in  polynomial  time  a  set  D'  of  full  td’s  and  a  state  p 
such  that  Dfcd  iff  p  is  incomplete  with  D'. 

Let  U  be  the  relation  scheme  for  the  dependencies  DUd.  Let  d  be  <T,w>,  with 
T ={wj, ....  wm}.  Without  loss  of  generality  assume  that  w  is  not  in  T.  To  test  whether  D  \=d, 
we  chase  T  by  D  and  see  whether  w  is  generated.  The  idea  of  the  reduction  is  to  have  a  state  p 
that  “looks  like”  T  and  a  set  D'  of  dependencies  that  simulate  D  and  in  addition  generate  a  “for¬ 
bidden"  tuple  if  w  is  generated.  Unlike  the  reduction  in  the  proof  of  Theorem  8,  we  have  to  be 
careful  not  to  generate  “forbidden”  tuples  too  early.  For  that  we  add  new  attributes:  the  database 

scheme  is  R={/?i,/?2},  where  Ri=f/U{>f,/Mi . 4*,}  and  /f2={C,Z)}.  The  new  universe  is 

f/'=R1UR2. 

Let  a  be  a  one-to-one  valuation  that  map  the  variables  in  T  to  constants.  p(R\)  has  tuples 
«i . un  that  correspond  to  the  tuples  w2 . wm  in  T  in  the  following  way: 

(1)  u,[U]=a(w(), 

(2)  Ui[A],  Ui[B J,  and  «,[/!,]  are  the  same  new  constant,  and 

(3)  ut  has  distinct  constants  elsewhere. 

p(/?2)  has  a  single  tuple  uQ  such  that  uJ[C]  and  uu(/)]  arc  the  same  new  constant 
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* 
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In  Tp,  the  tuples  uj, . . . ,  u„  are  extended  with  distinct  new  variables  for  the  attributes  C 
and  D,  and  u0  is  extended  with  unique  variables  for  all  attributes  other  than  C  and  D.  The  idea  is 
that  for  every  tuple  /  generated  by  a  chase  of  Tp,  t[A{[  and  t[D]  arc  variables,  so  no  “forbidden” 
tuple  is  generated  until  the  very  last  step. 

Let  now  <S,v>  be  a  full  td  in  D.  We  construct  a  full  td  <S',v'>  on  U'  and  put  it  in  D'.  For 
each  tuple  v,  in  S  we  have  a  tuple  v',-  in  S'  defined  as  follows: 

(1)  v'i[£/]=vi, 

(2)  v',[y4]  and  v ',  [/?]  are  the  same  new  variable,  and 

(3)  v',  has  distinct  new  variables  elsewhere. 

In  addition  S'  has  a  tuple  v'0  defined  as  follows: 

(1)  v'olC]  and  v'ofZ)]  are  the  same  new  variable,  and 

(2)  v'o  has  distinct  new  variables  elsewhere. 

Finally,  v'  is  defined  as  follows: 

(1)  v'[f/]=v, 

(2)  v'[Ai . Am]-v'0 [Ah....Am], 

(3)  v'M]  and  v'[#]  are  the  same  old  variable  (any  variable  from  v  will  do),  and 

(4)  v'[CD\=v\[CD]. 

For  example,  if  <S,v>  is: 
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Then  <S',v'>  is: 


•V** *  **.  WV*‘  *: 
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(Dot  represents  variables  with  unique  occurrences). 

In  addition  we  put  in  D'  a  full  td  <T',w'>.  T'  has  tuples  w'0 y»\, ....  wm'  that  correspond  to 
w,wh  . . . ,  wm. 
w' o  is  defined  as  follows: 

(1)  w'o(t/]=w,  and 

(2)  w'o  has  new  distinct  variables  elsewhere. 
w'j  is  defined  as  follows: 

(3)  w'Jt/Jsw,. 

(4)  w',[A]  and  w ',[/</]  are  the  same  new  variable,  and 

(5)  w' i  has  new  distinct  variables  elsewhere. 

Finally,  w'  is  defined  as  follows: 

(1)  w'[t/J=*v,  and 

(2)  w'[A ....  ....  Am,C,D\. 

For  example,  if  <T,w>  is: 
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Then  is: 
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(Dot  represents  variables  with  unique  occurrences). 

To  prove  that  D  fcd  iff  p  is  incomplete  with  respect  to  D'  we  show  that  a  chase  of  T  by  D 
can  be  simulated  by  a  chase  of  Tp  by  D'  and  vice  versa. 

Consider  first  a  chase  of  T  by  D.  We  claim  that  for  any  tuple  /  generated  by  a  td  <S,v>  in  a 
chase  of  T  by  D,  one  can  generate  a  tuple  /'  by  the  td  <S',vf>  in  a  chase  of  Tp  by  D'  such  that; 

(1) 

(2)  t\A\ and  t'\B]  are  the  same  constant, 

(3)  ...  .t'XAgX  rlCJ.  and  are  distinct  variable! 

We  leave  the  verification  of  this  claim  to  the  reader.  If  D  \=d,  then  w  is  generated  by  the 
chase  of  T  by  D.  Therefore,  a  chase  of  Tp  by  D'  generates  a  tuple  u  such  that  u[l/]=a(w).  Let 
us  now  apply  the  td  <T‘,w'>  in  the  chase  of  Tp  with  the  valuation  fi  that  maps  w'0  to  u  and  maps 
w‘,  to  u,.  This  generates  the  tuple  f}(wr)  with; 

(1)  j8(w'Xt/]=a(iv),  and 

(2)  fi[w^{A,B,Ab  •  •  •  . Am,C,D\. 

It  follows  that  /?( w'XrtJ  consists  solely  of  constants  and  is  not  in  p(R\),  since  *v  is  not  in  T.  So  p 
is  incomplete  with  respect  to  D'. 

Consider  now  a  chase  of  Tp  by  D'.  We  claim  that  for  any  tuple  /'  generated  by  a  td  <S'.v'> 
in  a  chase  of  Tp  by  D\  one  can  generate  a  tuple  t  by  the  td  <S,v>  in  a  chase  of  T  by  D  such 


(1)  /'[!/]=«(/). 

(2)  t'[A]  and  /'[fl]  arc  the  same  constant,  and 

(3)  t\A  J. ....  t'[Am\,  /'(C],  and  t'[D\  are  distinct  variables. 

We  leave  the  verification  of  this  claim  to  the  reader.  If  p  is  incomplete  with  respect  D'  then  the  td 
<T',w“>  must  be  applied  with  some  valuation  fi.  But,  since  w't[A]=: w'jlAj],  w\  cannot  be 
mapped  to  any  other  tuple  but  ut.  Thus  /J  necessarily  agrees  with  a  on  the  variables  that  arc  in  T. 
In  particular,  /?(w'Xf/]=a(H'),  so  w'o  must  be  mapped  to  a  tuple  u  generated  by  the  chase  of  Tp 
such  that  «ll/]=a(w).  But  then  w  is  generated  by  a  chase  of  T  by  D,  so  D  \=d. 

To  complete  the  proof  we  note  that  the  reduction  from  D  and  d  to  p  and  D'  is  polynomial. 


We  note  that  in  the  proofs  of  Theorem  8  and  9  we  have  untyped  dependencies  in  D'  even  if 
the  dependencies  in  D  are  typed.  Wc  believe  that  the  exponential  lower  bounds  for  consistency 
and  completeness  hold  also  for  typed  dependencies. 

5.  Testing  Satisfaction  under  Embedded  Dependencies 

In  the  previous  section,  we  restricted  our  attention  to  full  dependencies  in  order  to  obtain 
decidability  results.  In  this  section  we  return  to  arbitrary  sets  of  dependencies.  Our  main  result  will 
be  that  both  consistency  and  completeness  are  undecidable  in  this  general  setting.  To  show  this,  we 
will  prove  that  consistency  and  completeness  arc  recursively  equivalent  to  certain  dependency 
implication  problems  which  are  known  to  be  undecidable. 

The  first  step  is  to  reduce  consistency  to  the  implication  problem  of  egd’s  by  a  set  of  depen¬ 
dencies.  Let  p  be  a  state  and  D  a  set  of  arbitrary  dependencies.  Construct  a  set  of  egd’s  Ep  as  fol¬ 
lows.  Let  T  =  j >(Tp)  be  an  isomorphic  image  of  Tp  in  which  no  constants  appear.  For  every  pair 
of  distinct  constants  c  and  d  in  Tp,  <T,(v(c  ),v(d))>  is  an  clement  of  Ep. 

Theorem  10:  p  is  consistent  with  D  if  and  only  if  for  no  cgd  e  €  Ep  is  it  the  case  that  D  |=  e. 

Proof:  Suppose  p  is  consistent.  Let  /  €  wi-AKfD.p).  We  can  construct  from  v  a  homomor¬ 
phism  with  ij(T)C  /  and  for  each  constant  c  of  p,  tj(*»(c))  =  c.  I  certainly  satisfies  D,  but  it 
violates  each  cgd  in  Ep.  Therefore  no  element  of  Ep  is  implic  by  D. 

For  the  converse,  suppose  there  is  no  e  €  Ep  such  that  D  (=  e.  Let  Ep  =  (ci . <?*}  and 

ei  =  <T%(v(cj)Xdj))>.  Construct  from  T  an  atomic  sentence  r  by  letting  t  be  the  conjunction  of 
all  sentences  U{i)  such  that  t  is  a  tuple  in  7 .  Now  consider  the  sentence  d  given  by 


mu! 


3x(tAi»(c1)*i>W1)a  •  •  •  ^nck)*v(dk)), 

where  x  is  a  sequence  of  all  the  variables  in  T.  We  claim  that  D'  =  D  (J  d  is  finitely  satisfiable. 
Suppose  not;  then  D  }=  ~>d.  Now  ~^d  is  a  disjunctive  egd  of  the  form 

Y  x(r-*i»(ci)=i'(di)v  •  •  •  v  v(ck)=p(dk)). 

We  now  rely  on  a  finite  version  of  a  theorem  of  McKinsey  [McK]  due  to  Graham  and  Vardi  [GV] 
to  conclude  that  for  some  1  £i<k,  D  V  x(? -*i'(cl)=i>(di),  that  is,  D  £=  eh  contradicting  our 
assumption  that  D  does  not  imply  e  for  any  e  €  Ep. 

Since  D’  is  finitely  satisfiable,  it  has  a  finite  model  M.  Let  j(c)  be  the  domain  element 
assigned  to  each  variable  p(c)  of  d  to  make  d  true  in  M.  Note  that  s(c,)*  s(d,)  for  every  ch  dt 
appearing  in  an  inequality  in  d,  so  we  can  assume  without  loss  of  generality  that  s(c)-c.  Then 
M(£/)  is  a  weak  instance  for  p,  showing  that  p  is  consistent  □ 

We  now  reduce  the  implication  problem  for  egd’s  to  the  consistency  problem.  Let  D  be  any 
set  of  dependencies  and  let  e  =  <T,(a,b)>  be  an  egd.  We  form  the  set  Re  of  states  of  the  universal 
scheme  {U}  as  follows.  For  each  mapping  r  from  the  symbols  of  T  to  constants  such  that 
p(a)*p(b),  v{T)  is  a  member  of  Rt. 

Theorem  11:  D  ►=  e  if  and  only  if  no  state  in  Re  is  consistent  with  D. 

Proof:  Suppose  D  \=  e,  and  let  p(T)  be  any  state  in  Rt.  Q early  v(T)  violates  e.  Any  weak 
instance  in  weak(D,p(T)  must  satisfy  D,  and  hence  e.  But  w{T)  would  have  to  be  a  subset  of  such 
a  weak  instance,  which  is  impossible.  Hence  no  such  weak  instance  exists  and  v{T)  is  inconsistent 
with  D. 

For  the  converse,  suppose  D  does  not  imply  e.  Let  I  be  any  relation  that  satisfies  D  but  not 
e.  Such  a  relation  must  contain  a  homomorphic  image  v(T)  of  7,  such  that  v(a)*v(b).  Hence  /  is 
a  weak  instance  for  »(T),  which  is  an  clement  of  Rt.  □ 

From  the  last  two  theorems  we  obtain  the  following  immediate  corollary,  relating  the  decida¬ 
bility  of  the  membership  problem  for  an  egd  from  a  set  of  dependencies  and  the  decidability  of 
consistency  under  that  set  of  dependencies. 

Corollary  3:  Let  D  be  a  set  of  dependencies.  Let  Dt  be  the  set  of  cgd's  implied  by  D.  The 
following  are  equivalent. 


(a)  D,  is  recursive. 

(b)  For  every  database  scheme  R  over  the  universe  on  which  D  is  defined,  the  consistency  of 

every  state  of  R  is  decidable. 

(c)  The  consistency  of  every  state  of  the  universal  scheme  of  D  is  decidable. 

Proof:  (a)  implies  (b)  follows  from  Theorem  10;  (b)  implies  (c)  is  immediate;  (c)  implies  (a) 
follows  from  Theorem  11.  □ 

The  development  of  the  last  two  theorems  can  be  repeated  to  relate  completeness  to  td  impli¬ 
cation.  For  the  analogue  to  Theorem  10  construct  a  set  of  exponentially  many  td's,  Gp,  from  a  state 
p.  Elements  of  Gp  are  of  the  form  <T,w>,  where  T  is  the  image  of  Tp  under  an  injection  v  to  vari¬ 
ables,  and  w  is  constructed  as  follows.  Let  R(  be  a  relation  scheme  in  the  given  database  scheme, 
and  let  /  be  a  tuple  on  Rt  such  that  /  consists  of  constants  taken  from  p  but  t€p(Rt).  Then 
w[Rj]=*(<)  and  the  rest  of  w  consists  of  distinct  new  variables.  Informally,  each  element  <T,w> 
of  Gp  says  that  a  containing  instance  for  p  must  contain  a  tuple  w  such  that  its  projection  on  some 
relation  scheme  is  not  in  p.  Note  that  Gp  is  a  set  of  embedded  td’s. 

Theorem  12:  p  is  complete  with  respect  to  D  if  and  only  if  for  no  element  g  €  Gp  is  it  the 
case  that  D  g. 

Proof:  If  there  is  some  g  =<7\*v>  €  Gp  such  that  D  |=  g,  then  let  R  be  tire  relation  scheme 
that  led  to  include  g  in  Gp.  There  is  some  tuple  /  constructed  with  values  from  p  that  docs  not 
appear  in  p(R).  By  property  (3)  of  D,  we  know  that  if  D  f=  g  then  D  |=  g.  Let  /  be  an  clement 
of  WEAK(O.p).  Since  /  satisfies  D,  it  satisfies  g.  Let  p  be  a  valuation  such  that  p(Tp)  C  /.  Since 
T  is  the  image  of  Tp  under  an  injection  v  and  I  satisfies  g,  /  must  contain  some  tuple  whose  pro¬ 
jection  on  R  is  I.  It  follows  that  /  €  D  _  {wr (/)},  so  p  is  incomplete. 

/€WEAK(0,p) 

For  the  converse,  suppose  that  no  g  €  Gp  is  implied  by  D.  By  property  (3)  of  D,  no  g  €  Gp 
is  implied  by  D.  Thus,  for  every  g  €  Gp,  there  is  a  universal  relation  lg  such  that  lg  satisfies  D  but 
not  g.  Let  g  =  <7,,»v>  and  let  /  be  the  tuple  on  relation  scheme  R  that  led  to  the  inclusion  of  g  in 
Gp.  There  is  a  valuation  r  such  that  v(T)C  Ig  and  p(w[/?])£  vg(lg).  By  the  multiple  copies  con¬ 
struction  of  Theorem  2,  we  can  assume  that  v  is  injective.  Thus  we  can  assume  without  loss  of  gen¬ 
erality  that  /,  €  WEAK(O.p),  and  t  £  nn(Ig).  It  follows  that  t  £  p+(R).  Since  this  is  true  for  every 
tuple  I  constructed  from  values  in  p  but  not  itself  in  p(R),  it  follows  that  p  is  complete.  □ 

For  the  analogue  of  Theorem  11,  let  D  be  a  set  of  dependencies  and  g  =  <T,w>  a  td.  We 
may  assume  wiT,  else  g  is  trivial.  Let  U  be  the  relation  scheme  of  Z)Ug,  let 


R  *{A  |  w[A]  occurs  in  T],  and  let  R={U,R  }.  Let »  be  an  injection  from  variables  of  T  to  con¬ 
stants.  Let  S  be  the  set  of  all  relations  on  U  constructed  from  values  in  »(T)  that  contain  r(T). 
Let  if  be  the  set  of  all  states  of  the  form  »*(/■),  where  r  €  S  and  w*(r)  does  not  contain  r(w). 

Theorem  13:  D  g  if  and  only  if  every  state  of  K  is  incomplete. 

Proof:  Again  the  proof  parallels  that  of  Theorem  11.  If  D  g,  then  the  completion  o+  of  a 
for  each  a  €  K  is  such  that  r(wX/?  ]  €  a+(R ),  but  r(w)  €  a(R ),  hence  every  such  a  is  incomplete. 

Conversely,  if  D  does  not  imply  g,  let  /  be  a  relation  on  U  that  satisfies  D  but  not  g.  I  is  a 
weak  instance  for  die  state  a  =  wg(/)  in  K.  Since  a  is  exactly  the  projection  of  one  of  its  weak 
instances,  it  must  be  complete.  □ 

Corollary  4:  Let  D,  be  the  set  of  all  td’s  implied  by  a  set  of  dependencies  D.  D,  is  recursive 
if  and  only  if  completeness  of  any  state  of  any  database  scheme  over  the  attributes  of  D  is  decid¬ 
able.  □ 

We  now  state  the  main  result  of  this  section,  which  is  a  corollary  of  the  four  theorems  above. 

Theorem  14:  There  does  not  exist  an  algorithm  which  will  determine  for  every  pair  <D,p> 
whether  p  is  consistent  nor  whether  p  is  complete  with  respect  to  D. 

Proof:  The  implication  problem  of  egd’s  from  aibitrary  dependencies  was  shown  undccidable 
by  Vardi  [ValJ.  Implication  of  td’s  was  shown  undecidable  by  Vardi  [Va3]  and  Gurevich  and  Lewis 
fGLJ.  □ 

Since  no  general  algorithms  exist  for  deciding  either  completeness  or  consistency,  we  become 
interested  in  solvable  subcases.  If  implication  is  decidable  for  D,  for  example,  if  D  contains  only 
foD  dependencies,  consistency  and  completeness  arc  decidable,  as  shown  in  the  previous  section. 
But  there  may  be  specific  database  schemes  for  which  consistency  and  completeness  arc  decidable, 
even  if  implication  is  not  However,  there  is  no  algorithm  that  will  decide,  given  a  database  scheme 
and  a  set  of  dependencies,  whether  consistency  and  completeness  arc  decidable  for  that  scheme  and 
those  dependencies. 

Theorem  IS:  The  set  A  =  {<Z),R>  |  consistency  and  completeness  of  states  of  R  with  respect 
to  D  are  decidable  }  is  not  recursive. 

Proof:  Vardi  (Va2]  showed  that  it  is  undccidable  whether  the  implication  problem  for  a  set 
of  dependencies  D  is  decidable  or  not.  Suppose  A  were  recursive.  For  a  fixed  set  of  dependencies 
D,  the  predicate  of  these  dependencies  has  a  fixed  arity,  that  is,  a  fixed  set  of  attributes.  There  arc 
only  finitely  many  database  schemes  over  this  set  of  attributes.  As  implication  is  decidable  for  D  if 


and  only  if  <D,R>€  A  for  each  of  these  finitely  many  R’s,  an  algorithm  for  membership  in  A 
would  yield  an  algorithm  to  test  decidability  of  the  implication  problem.  So  A  cannot  be  recursive. 

□ 

6.  Discarding  the  Universal  Relation  Scheme 

The  sentences  in  Cp  and  Kp  use  a  predicate  letter  corresponding  to  the  universal  relation 
scheme  for  the  database.  It  is  interesting  to  ask  whether  we  can  construct  a  theory  with  properties 
similar  to  Theorems  4  and  5,  but  using  only  predicate  symbols  Ry  . . . ,  Rn  of  the  same  arities  and 
sorts  as  the  relation  schemes,  thus  avoiding  the  universal  predicate.  This  amounts  to  asking 
whether  dependency  satisfaction  can  be  expressed  in  a  “local”  way,  without  having  to  resort  to  the 
existence  of  a  universal  relation. 

There  is  a  special  case,  the  independent  schemes,  when  the  question  can  clearly  be  answered 
in  the  afTirmadve.  Given  a  set  of  dependencies  D  on  a  database  scheme  {Ry ... ,  R„},  the  pro¬ 
jected  dependencies  D,  are  all  the  dependencies  that  must  hold  in  any  relation  r,  on  fl,  such  that 
n  =  vRl(r\  where  r  is  a  universal  relation  satisfying  D.  A  state  p  is  called  locally  satisfying  if  every 
p(Rt)  satisfies  Dh  A  database  scheme  is  said  to  be  independent  if  every  locally  satisfying  state  is 
consistent  with  D.  When  the  database  scheme  is  independent,  we  can  write  down  the  required  set 
of  sentences  by  expressing  each  dependency  in  the  context  of  some  Rh  For  special  cases  such  as 
functional  and  multivalued  dependencies,  projected  dependencies  can  be  easily  characterized  in 
terms  of  the  original  set  D,  although  finding  the  D,' s  is  computationally  hard  [H].  For  more  general 
classes  of  dependencies,  we  do  not  even  know  if  the  D/’s  arc  finite.  In  the  general  case,  die  results 
in  this  section  should  be  viewed  as  existence  proofs  for  the  desired  sets  of  sentences,  rather  than 
effective  constructions. 

Our  main  result  in  this  section  is  that  in  fact  it  is  possible  to  construct  a  set  of  sentences  with 
the  desired  properties  when  the  database  scheme  is  weakly  cover  embedding.  To  define  this  notion, 
note  that  the  projected  dependencies  Dt  can  be  viewed  as  embedded  dependencies  on  U.  For  Dt 
defined  on  Rit  we  say  a  relation  on  U  satisfies  D/  if  w*((/)  does.  We  say  that  a  database  scheme  R 

If 

weakly  cover  embeds  a  set  of  dependencies  D  if  any  state  of  R  consistent  with  jj  D(  is  consistent 

<=i 

with  D.  In  the  framework  of  Section  4,  in  a  weakly  cover  embedding  scheme  it  suffices  to  chase 
using  only  dependencies  local  to  some  relation  scheme  of  R. 


-30* 


It  is  easy  to  see  that  the  class  of  weakly  cover  embedding  schemes  contains  both  the  cover 
embedding  or  dependency  preserving  schemes  [MMSU]  and  the  independent  schemes.  Since,  for 

fl 

cover  embedding  schemes,  we  have  (J  Dt  (=  D,  such  schemes  are  weakly  cover  embedding.  Since 

/=i 

ft 

any  state  consistent  with  (J  £>,  is  locally  consistent,  the  independent  schemes  are  weakly  cover 
1=1 

embedding.  A  polynomial  time  algorithm  for  testing  whether  a  weakly  cover  embedding  database 
scheme  is  independent,  in  the  case  where  all  dependencies  are  fd’s,  is  given  in  [GY].  Even  for  this 
restricted  case,  no  algorithm  to  test  whether  a  scheme  is  weakly  cover  embedding  is  known. 

Given  a  state  p  of  a  weakly  cover  embedding  database  scheme,  we  construct  a  new  set  of  sen¬ 
tences  Bp  as  follows.  The  language  of  Bp  is  the  same  as  the  language  of  A p,  except  that  we  do  not 
use  the  universal  predicate  letter  U .  Bp  contains  four  kinds  of  sentences. 

•  State  axioms:  for  each  Rt  and  each  tuple  i€p(Ri),  the  sentence  Rt(t). 

•  Join-consistency  axioms:  for  each  /?,  in  the  database  scheme,  Bp  contains  the  sentence 

Y  x(R,ix)-*Gbi  ■  •  •  b*Xl?i(v •  •  •  KRn(i„))) 


where  V/=x  and  the  v’s  arc  constructed  from  values  in  x  and  the  b's  so  that  for  all 
l<p,q<,n,  if  the  jth  attribute  of  Rp  is  the  A th  attribute  of  Rq,  then  vp[/']=v,[/t].  Intui¬ 
tively,  the  join-consistcncy  axioms,  together  with  the  state  containment  axioms,  assert  the 
existence  of  a  join-consistent  state  that  contains  p. 

•  Dependencies:  for  each  /,  the  set  of  dependencies  Dt  can  be  rewritten  as  a  set  of  first  order 
sentences  on  Rh 

•  Distinctness  axioms:  as  before,  these  assert  that  all  constants  are  distinct. 

Example  S:  We  construct  Bp  for  the  state  of  Examples  1  and  4.  The  universe  is 
U  =  { S,C,R,H },  the  relation  schemes  R\  =  SC,  R2  =  CRH,  Rj  =  SRH,  and  the  dependencies 
SH-*R,  RH-*C.  The  projected  dependencies  are:  D\=0,  D2={RH-*C},  D2={SH~*R}. 

State  axioms: 

R\(Jaek,CS31S) 

Kj(CS37M215,M10) 

RjiCS 378.B213.fK10) 


Vy; 


.*»  „**  .  p  , ,* 


*jUicM215,A/10) 


Join-consistency  axioms: 

(^x^xi){Ri(xix^-*(^bib^R^X2bib^\R3(xibib^)) 
(Yxxx2xj X/f  Axlx2xi)-,,(lb1)(R  libix^R  3(61X2X3))) 
(YxiXiXjX/f  ^JtiJf2JcjH(3  b{KR  lixibO^R  2(61X2X3))) 

Dependencies: 

(YriAiC1c2X/?2(ci'-iAiV'/?2(c2r1ALi-»Ci=c2) 

(Ys1rir2A1XJ?3(*lri*i)A^j(il^l)”*''l=',2) 


Distinctness  axioms: 
*215**213 
MIQ*W10 


etc. 


Theorem  16:  For  a  weakly  cover  embedding  database  scheme,  Bp  is  finitely  satisfiable  if  and 
only  if  p  is  consistent  with  D. 


Proof:  (10  Suppose  p  is  consistent,  and  let  7€wnAK(£>,p).  Let  r(  =«*.(/)  for  each  i.  Con¬ 
sider  the  interpretation  for  Bp  where  each  constant  is  mappped  to  itself  and  Rt  is  interpreted  as  r( 
for  each  i  By  definition  of  projected  dependencies,  each  r,  satisfies  D,.  Since  /  is  a  containing 
instance  for  p,  the  rj's  satisfy  the  state  containment  axioms.  Finally,  the  r,’s  arc  join-consistent  by 
construction  and  thus  satisfy  the  join-consistency  axioms.  It  follows  that  we  have  a  finite  model  for 
Bp- 


(Only  i0  Suppose  Bp  is  finitely  satisfiable.  Proceeding  as  in  Theorem  1,  we  can  assume  that 
there  is  a  database  state  p*  that  satisfies  Bp.  That  is,  p  contains  p,  is  join  consistent,  and  p\Rt) 
satisfies  D,  for  each  i.  Ijet  /  be  a  universal  relation  such  that  wr (/)=p\  Clearly  /  satisfies  the 


Dt\  so  / €weak((J  Z)(-,p'),  showing  that  p'  is  consistent  with  (J  /),.  Since  R  is  weakly  cover 

i  i 

embedding,  p'  is  also  consistent  with  D.  Since  pCp',  weak (Z),p')  CWEAK(/),p),  so  p  is  consistent 
with  D.  □ 

The  following  example  shows  that  the  construction  above  does  not  generalize  to  non-cover 
embedding  schemes;  we  leave  open  die  question  of  whether  such  a  set  of  sentences  can  be  con¬ 
structed  at  all  for  arbitrary  schemes. 

Example  6:  Let  R={AC,BC},  D  ={AB-*C,C~*B},  p(AC)={<01>, <02>}, 

p(BC)={< 31>,<32>}.  Note  that  Z)j  =  0,  Di  =  {C-*B}.  It  is  easy  to  see  that  p  is  consistent  with 
fliU  but  not  consistent  with  D.  However,  Bp  is  consistent  in  this  example,  since  p  is  a  join- 
consistent  state  that  satisfies  the  local  dependencies  and  hence  provides  a  model  for  Bp. 

7.  Discussion 

We  have  pointed  out  that  there  are  two  separate  sides  to  the  standard  notion  of  dependency 
satisfaction:  consistency  and  completeness.  We  view  consistency  as  corresponding  to  a  “lazy  evalua¬ 
tion”  tactic  for  constraint  maintenance.  As  long  as  no  violations  can  be  proven,  the  state  is  con¬ 
sidered  legal.  The  derived  tuples  not  present  in  the  state  can  be  generated  on  demand,  for  purposes 
such  as  query  answering.  Note  the  similarity  of  this  policy  to  the  “deductive  databases”  approach 
[GM],  where  any  fact  deducible  from  the  stored  relations  is  considered  part  of  the  database. 
Requiring  both  consistency  and  completeness  corresponds  to  a  constraint  maintenance  policy  that 
guarantees  that  all  derived  tuples  will  be  present  in  the  database  at  all  times.  There  is  a  storage- 
computation  tradeoff  in  the  choice  of  a  policy.  This  tradeoff  applies  not  only  to  multi-relation  data¬ 
bases  but  also  to  single  relations.  Consistency  of  a  relation  under  a  set  of,  say,  fd’s  and  mvd’s,  is 
strictly  weaker  than  standard  satisfaction. 

The  combination  of  our  notions  of  satisfaction  with  the  concept  of  independence  leads  to 
interesting  questions.  For  example,  what  arc  the  database  schemes  such  that  every  locally  consistent 
state  is  consistent  and  complete?  Chan  and  Mcndclzon  [CMJ  have  characterized  these  schemes 
when  the  join  dependency  for  the  database  scheme  and  a  set  of  functional  dependencies  are  given. 
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